Dataflow Java: Implicitly Parallel Java
نویسندگان
چکیده
Dataflow computation models enable simpler and more efficient management of the memory hierarchy a key barrier to the performance of many parallel programs. This paper describes a dataflow language based on Java. Use of the dataflow model enables a programmer to generate parallel programs without explicit directions for message passing, work allocation and synchronisation. A small handful of additional syntactic constructs are required. A pre-processor is used to convert Dataflow Java programs to standard portable Java. The underlying run-time system was easy to implement using Java’s object modelling and communications primitives. Although raw performance lags behind an equivalent Cbased system, we were able to demonstrate useful speedups in a heterogeneous environment, thus amply illustrating the potential power of the Dataflow Java approach to use all machines of whatever type that might be available on a network .. when Java JIT compiler technology matures.
منابع مشابه
Making State Explicit for Imperative Big Data Processing
Data scientists often implement machine learning algorithms in imperative languages such as Java, Matlab and R. Yet such implementations fail to achieve the performance and scalability of specialised data-parallel processing frameworks. Our goal is to execute imperative Java programs in a data-parallel fashion with high throughput and low latency. This raises two challenges: how to support the ...
متن کاملJava Tools and Technologies for Cluster Computing
The Java language and its associated libraries and environment provide a powerful and flexible platform for programming computer clusters. Java tools and technologies enable experimentation in both management aspects as well as performance aspects of cluster systems. We discuss the current interesting problems in cluster computing including those derived from distributed computing as well as th...
متن کاملAchieving Robust, Scalable Cluster I/O in Java
We present Tigris, a high-performance computation and I/O substrate for clusters of workstations that is implemented entirely in Java. Tigris automatically balances resource load across the cluster as a whole, shielding applications from asymmetries in CPU, I/O, and network performance. This is accomplished through the use of a dataflow programming model coupled with a work-balancing distribute...
متن کاملJava on networks of workstations (JavaNOW): a parallel computing framework inspired by Linda and the Message Passing Interface (MPI)
Networks of workstations are a dominant force in the distributed computing arena, due primarily to the excellent price/performance ratio of such systems when compared to traditionally massively parallel architectures. It is therefore critical to develop programming languages and environments that can potentially harness the raw computational power availab le on these systems. In this article, w...
متن کاملA Dataflow Framework for Java and the Checker Framework
class Node class *Node extends Node
متن کامل